Mining Co-Change Clusters from Version Repositories

نویسندگان

  • Dirk Beyer
  • Andreas Noack
چکیده

Clusters of software artifacts that are frequently changed together are subsystem candidates, because one of the main goals of software design is to make changes local. The contribution of this paper is a visualization-based method that supports the identification of such clusters. First, we define the co-change graph as a simple but powerful model of common changes of software artifacts, and describe how to extract the graph from version control repositories. Second, we introduce an energy model for computing force-directed layouts of co-change graphs. The resulting layouts have a well-defined interpretation in terms of the structure of the visualized graph, and clearly reveal groups of frequently co-changed artifacts. We evaluate our method by comparing the layouts for three example projects with authoritative subsystem decompositions. Classification: D.2.7 Distribution, Maintenance, and Enhancement – Restructuring, reverse engineering, and reengineering, D.2.7 Distribution, Maintenance, and Enhancement – Version control, G.2.2 Graph Theory, I.5.3 Clustering

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining evolutionary dependencies from web-localization repositories

An approach to mining repositories of web-based user documentation for patterns of evolutionary change in the context of internationalization and localization is presented. Localized web documents that are frequently co-changed (i.e., an evolutionary dependency) during the natural language translation process are uncovered to support the future evolution of the system. A sequential-pattern mini...

متن کامل

A Survey on Mining Software Repositories

This paper presents fundamental concepts, overall process and recent research issues of Mining Software Repositories. The data sources such as source control systems, bug tracking systems or archived communications, data types and techniques used for general MSR problems are also presented. Finally, evaluation approaches, opportunities and challenge issues are given. key words: mining, software...

متن کامل

Mining Container Image Repositories for Software Configuration and Beyond

This paper introduces the idea of mining container image repositories for configuration and other deployment information of software systems. Unlike traditional software repositories (e.g., source code repositories and app stores), image repositories encapsulate the entire execution ecosystem for running target software, including its configurations, dependent libraries and components, and OS-l...

متن کامل

Software Repositories: A Source for Traceability Links

This paper analyzes six open source projects in order to assess software repositories, such as those managed by Subversion, as a source for uncovering/discovering traceability links between different types of software artifacts. Our finding suggests that software repositories store a variety of artifacts that are central to open source development and use. Furthermore, a heuristic-based approac...

متن کامل

ModularityCheck: A Tool for Assessing Modularity using Co-Change Clusters

It is widely accepted that traditional modular structures suffer from the dominant decomposition problem. Therefore, to improve current modularity views, it is important to investigate the impact of design decisions concerning modularity in other dimensions, as the evolutionary view. In this paper, we propose the ModularityCheck tool to assess package modularity using co-change clusters, which ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005